A Generic Training and Classification System Formirex08 Classification Tasks: Audio Music Mood, Audio Genre, Audio Artist and Audio Tag

نویسنده

  • Geoffroy Peeters
چکیده

This extended abstract details a submission to the Music Information Retrieval Evaluation eXchange (MIREX) 2008 for the training and classification tasks audio music mood, audio genre, audio artist and audio tag. The same system has been submitted for the various tasks without any adaptations to the specific problems. The system named ircamclassification is a generic system which performs batch feature extraction, models training (using various classifiers) and file indexing (or file segmentation) into classes. The features extracted are generic in order to be applicable to many different audio and music indexing problems. The features are not specific to the above mentioned MIREX08 tasks. The goal of this submission is to test the applicability of a generic classification system to those tasks. 1 SYSTEM DESCRIPTION ircamclassification is an extension of a system initially developed for instrument-samples indexing described in [3] using the features described in [4]. Only the subset of features applicable to polyphonic audio signals (music) has been used here. In [5] the system has been extended for speech/music segmentation. It is this system that has been used forMIREX08 tasks. We briefly review it in the following. 2 FEATURE EXTRACTION In the present submission, only three sets of audio features are extracted from the signal. MFCC: The first set aims at describing the shape of the spectrum at each time. Mel Frequency Cepstral Coefficients (40 Mel bands, 13 coefficients including DC component) are extracted every 20ms using a Blackman window of length 40ms. SFM/ SCM: MFCCs only describes the shape of the spectrum whatever the content of the signal is noise or sinusoidal (harmonic) components. In order to describe this noise/ sinusoidal content, we also compute height Spectral Flatness [2] and Spectral Crest Measure coefficients. This is done using the same analysis parameters. Chroma/ PCP: The third set of features gives rough information about the meaning of the harmonic content of the signal. For this, twelve Chroma [6]/ Pitch Class Profiles (PCP) [1] coefficients are computed using a Blackman window of length 100ms synchronized in time with the two other feature sets. Delta and acceleration coefficients of the above mentioned features are also computed. Finally, a simple temporal modelling (mean and standard deviation) of each feature is performed using a sliding window of length 500ms and a hop size of 250ms.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Labrosa’s Audio Classification Submissions

We have submitted a system to MIREX 2008’s audio music classification tasks. It employs the spectral features described in [2] in addition to novel stereo-based features. For the n-way audio classification tasks (artist, classical composer, genre, latin genre, and mood identification) it uses a DAGSVM to perform classification. For the tag classification task, it uses a simple binary SVM with P...

متن کامل

Labrosa’s Audio Music Similarity and Classification Submissions

We have submitted a system to MIREX 2007’s audio music similarity and classification tasks. It employs spectral features based on [1] and temporal features similar to those described in [3]. For the similarity task, it calculates the distance between songs as the Euclidean distance between their feature vectors. For the audio classification tasks (artist, classical composer, genre, and mood ide...

متن کامل

Marsyas Submissions to Mirex 2007

Marsyas is an open source software framework for audio analysis, synthesis and retrieval with specific emphasis on Music Information Retrieval. It is developed by an international team of programmers and researchers led by George Tzanetakis. In MIREX 2007 we participated in the following tasks: Audio Artist Identification, Audio Classical Composer Identification, Audio Genre Classification, Aud...

متن کامل

Audio Mood Classification Using Ensemble Classifier with Music Tag Based Indexing

This paper presents a system for audio classification using multiple binary ensemble classifiers with music tag based indexing in a one-versus-all classification scenario. The proposed system has won the audio classification task on mood dataset in MIREX 2010 and is implemented as follows. First, in the training phase, the frame-based 70-dimensional feature vectors are extracted from a training...

متن کامل

Audio Tag Annotation and Retrieval Using Tag Count Information

Audio tags correspond to keywords that people use to describe different aspects of a music clip, such as the genre, mood, and instrumentation. With the explosive growth of digital music available on the Web, automatic audio tagging, which can be used to annotate unknown music or retrieve desirable music, is becoming increasingly important. This can be achieved by training a binary classifier fo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008